智能论文笔记

Incremental Recursive Ranking Grouping for Large Scale Global Optimization

Marcin Michal Komarnicki , Michal Witold Przewozniczek , Halina Kwasnicka

分类：神经与进化计算

2022-06-08

现实世界优化问题可能具有不同的基础结构。在黑盒优化中，决策变量之间的依赖关系仍然未知。但是，某些技术可以准确发现此类相互作用。在大规模的全球优化（LSGO）中，问题是高维的。显示出将LSGO问题分解为子问题并分别优化它们有效。这种方法的有效性可能高度取决于问题分解的准确性。许多最新的分解策略来自差分组（DG）。但是，如果给定的问题由不可分离的子问题组成，则它们仅检测真实相互作用的能力可能会大大减少。因此，我们提出了不遭受此缺陷的增量递归排名分组（IRRG）。 IRRG比最近基于DG的命题（例如递归DG 3（RDG3））消耗更多的健身功能评估。然而，对于适合RDG3的可添加性可分离子问题而言，嵌入IRRG或RDG3后所考虑的合作共同进化框架的有效性相似。但是，在用非添加的嵌入IRRG代替可分离性后，IRRG会导致质量明显更高的结果。

translated by 谷歌翻译

Context based lemmatizer for Polish language

Michal Karwatowski , Marcin Pietron

分类：自然语言处理 | 人工智能

2022-07-23

诱饵是将单词的易位形式分组在一起的过程，因此可以将它们分析为单个项目，该项目由单词的引理或字典形式识别。在计算语言学中，柠檬酸是基于其预期含义来确定单词引理的算法过程。与词干不同，lemmatisation取决于正确识别句子中单词的语音和含义的预期部分，以及在该句子周围的较大上下文中。结果，开发有效的Lemmatisation算法是复杂的任务。近年来，可以观察到，用于此任务的深度学习模型优于包括机器学习算法在内的其他方法。在本文中，提出了基于Google T5型号的波兰狐猴仪。培训的上下文长度不同。该模型可以实现波兰语言狐猴化过程的最佳结果。

translated by 谷歌翻译

Defense Against Adversarial Attacks on Audio DeepFake Detection

Piotr Kawa , Marcin Plata , Piotr Syga

分类：机器学习

2022-12-30

Audio DeepFakes are artificially generated utterances created using deep learning methods with the main aim to fool the listeners, most of such audio is highly convincing. Their quality is sufficient to pose a serious threat in terms of security and privacy, such as the reliability of news or defamation. To prevent the threats, multiple neural networks-based methods to detect generated speech have been proposed. In this work, we cover the topic of adversarial attacks, which decrease the performance of detectors by adding superficial (difficult to spot by a human) changes to input data. Our contribution contains evaluating the robustness of 3 detection architectures against adversarial attacks in two scenarios (white-box and using transferability mechanism) and enhancing it later by the use of adversarial training performed by our novel adaptive training method.

translated by 谷歌翻译

Detection of out-of-distribution samples using binary neuron activation patterns

Bartlomiej Olber , Krystian Radlak , Adam Popowicz , Michal Szczepankiewicz , Krystian Chachula

分类：机器学习

2022-12-29

Deep neural networks (DNN) have outstanding performance in various applications. Despite numerous efforts of the research community, out-of-distribution (OOD) samples remain significant limitation of DNN classifiers. The ability to identify previously unseen inputs as novel is crucial in safety-critical applications such as self-driving cars, unmanned aerial vehicles and robots. Existing approaches to detect OOD samples treat a DNN as a black box and assess the confidence score of the output predictions. Unfortunately, this method frequently fails, because DNN are not trained to reduce their confidence for OOD inputs. In this work, we introduce a novel method for OOD detection. Our method is motivated by theoretical analysis of neuron activation patterns (NAP) in ReLU based architectures. The proposed method does not introduce high computational workload due to the binary representation of the activation patterns extracted from convolutional layers. The extensive empirical evaluation proves its high performance on various DNN architectures and seven image datasets. ion.

translated by 谷歌翻译

Adapting to game trees in zero-sum imperfect information games

Côme Fiegel , Pierre Ménard , Tadashi Kozuno , Rémi Munos , Vianney Perchet , Michal Valko

分类： (统计)机器学习 | 机器学习

2022-12-23

Imperfect information games (IIG) are games in which each player only partially observes the current game state. We study how to learn $\epsilon$-optimal strategies in a zero-sum IIG through self-play with trajectory feedback. We give a problem-independent lower bound $\mathcal{O}(H(A_{\mathcal{X}}+B_{\mathcal{Y}})/\epsilon^2)$ on the required number of realizations to learn these strategies with high probability, where $H$ is the length of the game, $A_{\mathcal{X}}$ and $B_{\mathcal{Y}}$ are the total number of actions for the two players. We also propose two Follow the Regularize leader (FTRL) algorithms for this setting: Balanced-FTRL which matches this lower bound, but requires the knowledge of the information set structure beforehand to define the regularization; and Adaptive-FTRL which needs $\mathcal{O}(H^2(A_{\mathcal{X}}+B_{\mathcal{Y}})/\epsilon^2)$ plays without this requirement by progressively adapting the regularization to the observations.

translated by 谷歌翻译

Biomedical image analysis competitions: The state of current participation practice

Matthias Eisenmann , Annika Reinke , Vivienn Weru , Minu Dietlinde Tizabi , Fabian Isensee , Tim J. Adler , Patrick Godau , Veronika Cheplygina , Michal Kozubek , Sharib Ali

分类：计算机视觉 | 机器学习

2022-12-16

The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.

translated by 谷歌翻译

Fine-grained Czech News Article Dataset: An Interdisciplinary Approach to Trustworthiness Analysis

Matyáš Boháček , Michal Bravanský , Filip Trhlík , Václav Moravec

分类：自然语言处理

2022-12-16

We present the Verifee Dataset: a novel dataset of news articles with fine-grained trustworthiness annotations. We develop a detailed methodology that assesses the texts based on their parameters encompassing editorial transparency, journalist conventions, and objective reporting while penalizing manipulative techniques. We bring aboard a diverse set of researchers from social, media, and computer sciences to overcome barriers and limited framing of this interdisciplinary problem. We collect over $10,000$ unique articles from almost $60$ Czech online news sources. These are categorized into one of the $4$ classes across the credibility spectrum we propose, raging from entirely trustworthy articles all the way to the manipulative ones. We produce detailed statistics and study trends emerging throughout the set. Lastly, we fine-tune multiple popular sequence-to-sequence language models using our dataset on the trustworthiness classification task and report the best testing F-1 score of $0.52$. We open-source the dataset, annotation methodology, and annotators' instructions in full length at https://verifee.ai/research to enable easy build-up work. We believe similar methods can help prevent disinformation and educate in the realm of media literacy.

translated by 谷歌翻译

Fast-moving object counting with an event camera

Kamil Bialik , Marcin Kowalczyk , Krzysztof Blachut , Tomasz Kryjak

分类：计算机视觉

2022-12-16

This paper proposes the use of an event camera as a component of a vision system that enables counting of fast-moving objects - in this case, falling corn grains. These type of cameras transmit information about the change in brightness of individual pixels and are characterised by low latency, no motion blur, correct operation in different lighting conditions, as well as very low power consumption. The proposed counting algorithm processes events in real time. The operation of the solution was demonstrated on a stand consisting of a chute with a vibrating feeder, which allowed the number of grains falling to be adjusted. The objective of the control system with a PID controller was to maintain a constant average number of falling objects. The proposed solution was subjected to a series of tests to determine the correctness of the developed method operation. On their basis, the validity of using an event camera to count small, fast-moving objects and the associated wide range of potential industrial applications can be confirmed.

translated by 谷歌翻译

Synthetic Image Data for Deep Learning

Jason W. Anderson , Marcin Ziolkowski , Ken Kennedy , Amy W. Apon

分类：计算机视觉 | 机器学习

2022-12-12

Realistic synthetic image data rendered from 3D models can be used to augment image sets and train image classification semantic segmentation models. In this work, we explore how high quality physically-based rendering and domain randomization can efficiently create a large synthetic dataset based on production 3D CAD models of a real vehicle. We use this dataset to quantify the effectiveness of synthetic augmentation using U-net and Double-U-net models. We found that, for this domain, synthetic images were an effective technique for augmenting limited sets of real training data. We observed that models trained on purely synthetic images had a very low mean prediction IoU on real validation images. We also observed that adding even very small amounts of real images to a synthetic dataset greatly improved accuracy, and that models trained on datasets augmented with synthetic images were more accurate than those trained on real images alone. Finally, we found that in use cases that benefit from incremental training or model specialization, pretraining a base model on synthetic images provided a sizeable reduction in the training cost of transfer learning, allowing up to 90\% of the model training to be front-loaded.

translated by 谷歌翻译

Machine intuition: Uncovering human-like intuitive decision-making in GPT-3.5

Thilo Hagendorff , Sarah Fabi , Michal Kosinski

分类：自然语言处理 | 人工智能 | 机器学习

2022-12-10

Artificial intelligence (AI) technologies revolutionize vast fields of society. Humans using these systems are likely to expect them to work in a potentially hyperrational manner. However, in this study, we show that some AI systems, namely large language models (LLMs), exhibit behavior that strikingly resembles human-like intuition - and the many cognitive errors that come with them. We use a state-of-the-art LLM, namely the latest iteration of OpenAI's Generative Pre-trained Transformer (GPT-3.5), and probe it with the Cognitive Reflection Test (CRT) as well as semantic illusions that were originally designed to investigate intuitive decision-making in humans. Our results show that GPT-3.5 systematically exhibits "machine intuition," meaning that it produces incorrect responses that are surprisingly equal to how humans respond to the CRT as well as to semantic illusions. We investigate several approaches to test how sturdy GPT-3.5's inclination for intuitive-like decision-making is. Our study demonstrates that investigating LLMs with methods from cognitive science has the potential to reveal emergent traits and adjust expectations regarding their machine behavior.

translated by 谷歌翻译